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(57) Genes at least partially composed of stochastic synthetic polynucleotides are simultaneously produced within 
the same medium, the genes thus obtained are introduced into host cells; the independent strains of modified 
host cells enclosing said genes are simultaneously grown, so as to clone the stochastic genes and to promote the 
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said property. Analogously, stochastic DNA or RNA sequences showing at least a desired property can be 
produced by screening and selecting strains of modified host cells so as to identify the strains enclosing in their 
genome said DNA or RNA sequences. 
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Technique 

for obtaining DNA, RNA. PeP t ' de *' ? cells 
proteins, through use ^"^Sn^hSe RNA.. 
Staining flan..^ cepable , * is to say. by 



of oligonucleotides without -^Sw£ 
manner to form fragments of f «ochest.c DN A. 
Sowed by ligation of these 8 
previously notably a 

70 The "^""rniS E«e em retutts have been 
bactenal plasmid. Exceiiem resu express j 0 n 
obtained using the plasm.d pUC8 as the exp 

vector.' he viral DNA or a 

The expression vector can a so be v.ra. w 



aecond mode mentioned abova.^ . po 



uSS^ 
The invention aims in. £Mutor« £e£0 

of stochastic •^*2^j!3S M ou*y. 
in a fashion to permit obtain ng s.mu 

15 after transcription and * ^SojOOO) of 

vary large number (on the host 

among the ^JjJ ^desired propertias. 
of them produce-proteins wnn a 

for example structural. •^^tiwJf' 

^gnS« 8 ofDN^g^ 
-j^oesOifiS KUf rhflmt rn1 " 1Qcn? • 

30 biological 



85 



•pmnd mode menuoncu oww^# — - . 
80 Sorgonucleotideswhichform.groupof 

^cTanyl^sults are obtained by utilizing 
theS^nggmupofpalindromicoctamers. 

5- GGAATTCC 3' 
5'GGTCGACC3' 
5' CAAGCTTG 3' 
S' CCATATGG 3' 
5- CATCGATG 3' 

tt is also possible to use oligonucleotides which 
?«?m «omuo of palindromic heptamers. 
Vei g^d results are obtained utilizing the 

foKg group - «» ,inaromiC hept8mer8 ' 



90 



95 



6'XTCGCGA3' 
5' XCTGCAG 3* 
5'RGGTACC3' 



- 1I 8SSBBfcl««*. invent^ is ope" to a 

number of applications in very many 
^eas of science, industry and med.c ne. 
" e process for production of pap . des or 
35 PoWPeptides -o^ 

?n h JneMmUedium,gane S whichareatleast puri f«s the 'ransf om^ng ^ 

Obtained into host cells. that one 



10 0 where X- A. G. C ° r J. »"J R " J ^ procedures 
According to a method tt ut.UM tne. p 

porif.es the trantform^r^DNAoMhe^ ^ 



40 obttln^^^^-^^oftht 

,imU ?'^eo?o SSSSSb «— - eneS ,n 
,r "7 . T/nnir S?s to done the stochestic genes 
such a manner ao ■» w **w -retains 

end to obtain the P^^J^SSS^ ** 



host cells ootameu ^ - • M one 
ebove. then the punfied ON A « cut oy 

octamer, or bopt-mersb^bwrrtf^mt^ ^ 

then on "^^^ U D Z j 80 thus obtained 
linearized stochastic DNA TrBgni 

^ T 4 D NA.igase.,n.ucM 



In a first mooe o. . t- »" ^.^.c 
st chastic genes are pr f^..^*, 

55 eopolvm^i^^tdTIWnll- 
deoxyphosphonucle tides, ion vect0 r. 

two ends 1 an initielly hn J""*^ jn 8uch , 
f iiowedbyf rmat. " of cohesive n f ONA 
fashion est f m »« 0 f 8 ^ fi p ^ 0 n vector 
60 constitut d by a molecul ^ w 3 > .„ d s 



sequences, tnis new «. w— h 
. number of stochastic genes ^J« n J^ tnen 
number of genes in the mm JJ-"** : ° 0NAt 

^esthisn^ 
120 transf rm the host censano*. . d 

finallyutiliz sscreening •^•^?J 08t cells 
iso ,.tes th new c'onas of transf ormedljo ^ ^ 



isolates th new cones u.u t , eB st 
end finally th se are cutovat diPj new 
ne peptide or polypeptide, for example. 

125 protein. ..♦h-Miteri n for selection 

The ProP^nS "s «n SE «P^ of the 
of the d nes fh stalls canoe 8 - ive n 
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peptides end/or polypeptides, the said property can 
be the capacity to catalyse a sequence f reactions 
leeding from an initial group of chemical 
compounds to at leest one target compound. 

5 With the aim of producing an ensemble 
constituted by a plurality of peptides and 
polypeptides which ere ref lexively autocatalytic. the 
said property can be the cepacity to catalyse the 
synthesis of the same ensemble from amino acios 

1 o end/or oligopeptides in an eppropriate milieu. 
The said property can also be the capacity to 
modify selectively the biological or chemical 
properties of a given compound, for example the 
capacity to selectively modify the catalytic activity of 

15 a polypeptide. ,„ „ 

The said property can also be the capacity to 
stimulate, inhibit, or modify at least one biological 
function of at least one biologically active 
compound, chosen, for example, among the 

20 hormones, neurotransmitters, adhesion i factors, 
growth factors and specific regulators of DMA 
replication and/or transcription and/or translation of 

The said property can grmnlW hft the capacity of. 

The inventionllso has as its object the use of the 
peptide or polypeptide obtained by the process 
specified above, for the detection and/or the titration 

of a ligand. __,«« _i 

30 According to a particularly advantageous mode of 
carrying out the invention, the critenon for selection 
of the clones of transformed host cells is the 
capacity of these peptides or polypeptides to 
simulate or modify the effects of a biologically 
35 ective molecule, for example, a protein, and 

screening and/or selection for clones of transformed 
host cells producing at least one peptide or 
polypeptide having this property, is carried out by 
preparing antibodies against the ective molecule. 
40 then utilizing these antibodies after their 

purification, to identify the clones containing this 
peptide or polypeptide, then by cultivating i the 
clones thus identified, separating and purifying the 
peptide or polypeptide produced by these clones. 
45 and finally by submitting the peptide or polypeptide 
to an in vitro assay to verify that it has the capacity 
to simulate or modify the effects of the said 

molecule. , .... 

According to another mode of carrying out the 
50 process according to the invention, the IP'OP 8 /** 
Serving as the criterion of selection is that of having 
st least one epitope similar to one of the epitopes of 
e given antigen. . . 

The invention carries over to obtaining 
55 polypeptides by the pr cesssp cifi d above and 
utilizable as chemotherapeuticelly active 

substances. ... 

in particular, in the case where the said i anbg r « 
EGF.th invention permits obtaining polypeptides 
60 usable for chemotherapeutic treetment of 

^'IdTnflio. variant fthe procure on. 
id ntffjeiAieolates^^ 



ehr mata graphv against antibodie s corresponding 
t a protei nexprg§§sd_bY IhP iral P art ot the 

DNA hybrid. . 
For example, in the case where the natural part of 
70 the hybrid DNA contains a gene expressing B- 
gslactosidase. one can edvantageously identify and 
isolates the said clones of transformed host cells by 
sffinity chromatography against anti-B- 

galactosidase antibodies. 
75 After expression and purification of hybrid 

peptides or polypeptides, one can separate and 
isolate their novel parts. 

The invention also applies to a use of the process 
specified above for the preparation of a vaccine; the 
80 application is characterized by the fact that 
entibodies against the pethogenic agent are 
isolated, for example antibodies formed after 
injection of the pethogenic agent in the body of an 
snimal capable of forming antibodies against :th.s 
85 agent, end these antibodies ere used to identify the 
clones producing at least one protein having at least 
one epitope similar to one of the epitopes of the 
pathogenic agent, the transformed host cell 
corresponding to these clones are cultured to 
90 produce these proteins, this protein is isolated and 
purified from the clones of cells, then this protein is 
used for the production of e vaccine against the 

pathogenic agent. 
For example in orderto prepare an anti-HVB 
95 vaccine, one cen extract and purify at least one 
capside protein of the HVB virus, inject this protein 
into an animal capable of forming entibodies 
against this protein having at least o" 6 epitope 
similar to one of the epitopes of the HVB virus, then 
100 cultivate the clones of transformed host cells 
corresponding to these clones in a manner to 

produce this protein, isolate and purify the protein 
from culture of these clones of cells and ut.l« the 
protein for the production of en anti-HVB vaccine. 
1 05 According to an advantageous mode of carrying 
out the process according to the invention, the host 
cells consist in bacteria such as Escherichia coli 
whose genome contains neither the natural gar .e 
expressing B-galactosidase. nor the *BG «ene. that 
110 istosav Z-.EBG-E coli. The transformed cells are 
cXedin the presence of X gal and the indicator 
IPTG in the medium, and cells positive for B- 
gelactosidase functions are detected; thereafter, the 
transforming DNA is transplanted into an 
1 1 5 appropriate clone of host cells for large scale culture 
to produce at least one peptide or PO'VP^' 0 " 6 - 

Ibftpwpertv serving asJtaa rritrrinn fnr Mlectioa 
of thetrapsl Q i mprl hnM Ettlls ca oJ l« ° » e thp 
-cSp^rtyofth^ 
120 b y the culturao f ^ to bind t0 a give 

,C - Th?sc5 mpound cen be in particular chosen 
advantageously among peptides. P^^ nd 
proteins notably among prot ins regulating the 
125 transcription activity fONA. . 

n V h.»th«r hanrt thpinif l rnmpni i ndcana'so pe 

J^JS^Ki mmS^^ proteins 
" the inv ntion has aisles its b.ect th se ^pro 
which are obtain d in the case ^wh r the property 



transformed host cells consist in the capacity f 

controlling transcription activity or 
to DNA and RNA sequences. 

10 includes utilization of ^£S!tS^ of 

.econd cese "--•^'jSIJSSSoilA. in 
transcription or replication ota sequen 
a cell containing the sequence of DNA, ana 

15 expressing this ' IJJ*J- bject 8S we ll a process of 

r^The invention nas as «s uui»w _. .i..„a«n« 

production of DNA. charactered bv*myjIMHHi»- 

pro 



2( 



urorliirtiTT-j" * h » f me medium^ — ■_ 

S^^^^^btained are 
-JoTynucleotides. in that the 8 e1 *^ fl Vensemble of 
Sauce^nto host cells to produce an ' •™J mw 
imroou^ screening and/or 



at leaj 



100 



^BjgucOTto host ceils to prwuuv* -. - — 
Sormed host cells. ^ 
.election on this ensemble is carnea oui w 

SSX* cells "^"J^^^i one 
stochastic sequences of ON A having at t lea* 

production in the can** mcdiucn, of *."« « MO 

capacity to rmn'y * giwn cnrrmnw 

45 mpacitv to be atrjaaf iff h inven tion will 

-fewthe^cSs^^ its 

procedures to carry out the in 
Jenes. and the introduction f « h ^ o ^ S D8Cte ria. 
bacteria to produce clones of transforms* oacr 

55 « Direct Synth sis nan Expression Vector, 
a) Linearization f the Vector „ 
30 micr grams, that is. approximately ^ 10 

J2 ~les 9 f the P UC8 ^^SSS^ ™ 
linearis dbyincubat. nf ^houwi"' 

60 unit, of the Pstl '*^j^f*'?^p^^^butfer^T1ire 

linearized vect r is twet d I w«n jn ^ ^ 

then precipitated .n ^ethan J ™££ p , in 8t8nd ard 
30 1 and loaded onto a 0.8% agaros^ 9 f ^ ^ ^ 



three hours, the ^^^^SS^- 
precipitated in ethenol. and tak n up m ju i o 

b) Stochastic Synthesis Using the Enzyme Terminal 

hours at 3rc with 30 .unto »£SlP. 1 

75 ^Srs'tiO" of dTTP is chosen «n , orde r to reduce 
Z frequency of "stop" codons . j the 
corresponding messenger RN * * "^J^ed 
although somewhat less *"2S^K«n for 

80 ^^^^B^ 

90 SaSagssSSsr 

in the presence of oATP ' ™" Separated by a 
ends of the polymers. 



Anus u» u»w — 

« Synth** .1 <n. Second M-^JSfl* 

W mM EDTA at «T for 10 ™ n "%£^ t *< 3 to 4" 
nucleotide triphosphate [^^SSte of the 

(^Transformation of Cogent Ctone s 
^100 to 200 ml of competent HB101 oto«« 
,20 concentration of 10" cells/ml. are ncuoaw ^ 

the stochestic DNA , P^^JhCI^B. 6 mM 
pr senceof6mMCaCI,,6mMTns M P kof 

MgCl.f rSOminut •J^ A 0 ^ mWlW . 
3 minutes at towpoj donj " NZY 
,25 f Howed by the •^^iSics.The 

cultur m d^'^Xt dat3rfor60 
transformed culture is ^cubat a ei ^ NZY 

minut s. then diluted to 1 0U re • After 

medium -"MWnj^jJ^^ amplified 
130 3-5 hours of incubation at 3/ v.. 
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culture .. centrifuged. end the P*»^ ,r ^ h °' med 
cells is lyophilysed end stor d et -70°C. Such e 
culture conteins 3x10' to 10« * nde P ende "* ^ 
trensformsnts, eech conteining e unique stochestic 
5 gene inserted into the expression vector. 

II) Synthesis of Stochestic Genes Sterling From 
Oligonucleotides Without Cohesive Ends. 
This procedure is besed on the feet ** 
10 polymerizetion of judiciously chosen P**™* 
oHgonucleotides permits construction of stochestc 
genes which heve no "stop" codon in eny of the s.x 
possible reeding fremes. while et the seme MUme 
assuring a belenced representation of triplets 
1 s specifying ell emino ecids. Further, end to evwd e 
1 5 w Uion 9 of sequence motifs in the P'Oteinswhu* 
reSult, the oligonucleotides can conteln a number of 
bases which is not e multiple of three.lhe , ex.mpl. 
which follows describes the use of * 
20 possible combinetions which fulfil these critene. 

a) Choice of e Group of Octemers 
The group of oligonucleotides following. 



25 



5' GGAATTCC3' 
5' GGTCGACC 3' 
5' CAAGCTTG 3' 
5' CCATATGG 3' 
5' CATCGATG 3* 



30 



is composed of 5 pelindromes (thus self 
im3e P ment.rysequences)whereitise. 
thet their stochestic polymerizetion does not 
generate eny "stop" codons. end specifies ell the 

35 emino ecids. . 

Obviously, one cen utilize other group of 

pelindromic octemers which do not Bene^eeny 
-stop" codons end specify ell the »™"o » c ' d ^°£ d 
in polypeptides. Clearly, it is elso possible to utilize 

40 non pelindromic groups of octemers. or other 
oligomers, under the condition thet their 
complements forming double strended DNA ere 

also used. 

45 b) Assembly of e Stochestic Gene From e Group of 

Octemers 

A mixture conteining 5 MB «»ch ot tne 
oligonucleotides indiceted ebove (previously 
phosphoryleted et the 5' position by e standard I _ 
50 procure) is reacts 

1 mM ATP. 10% polyethyleneglycol. end 100 unrts 
of T4 DNA ligese in the appropriate buffer at 13X for 
aix hours. This step carries out the stochastic 
polymerizeti n of the oligomers in the double 
S5 strend d stste end with utc hesive nds.The 
Sngpolymerserei. Isteo by passage over e 
mTcuirsiev (Biog.l P60) me veringth sewith 
20 to 100 lig mers. After concentreti n.this 
fracti n is egsin submitted t cetelysisor 
60 oolym.riz.ti n by T4 DNA ligese under the 
60 conditions describe ebove. There^er e. 
described ebove. th se polymers which heve 
assembled at leest 100 oligomers ere isoleted. 



c) Preparation Of The H stPlasmid 

The pUC8 expression v ctor is linearized by Smal 
enzyme in the appropriate buffer, as described 
above. The vector linearized by Smal does not have 

70 cohesive ends. Thus the linearized vector i is treated 
oy calf intestine elkaline phosphetase (CIP) at a level 
of one unit per microgram of vector in the 
appropriate buffer, et 37»C for 30 m.nutes. The CIP 
enzyme is thereafter inectivated by two success.ve 

75 .Son, with phenol-eh.oroforn, The neanzed 
and dephosphorylated vector is precipitated n 
ethenol.then redissolved in water at 1 mg/mi. 

d) Ligation of Stochastic Genes To The Vector 
80 Equimoler quantities of vector end polymers are 
mixed and incubated in the presence of 1000 un.ts 
of T4 DNA ligese. 1 mM ATP. 10% Po'vetMene 
glycol, in the eppropriete buffer, for 12 hours at 
iyC This step ligates the stochest-c polymers in the 
85 expression vector end forms double 

circular molecules which are. therefore, capable of 

transforming. 

Transformation of Competent Clones 
90 Transformation of competent clones is cemed out 
In the manner previously desenbed. 

Ill) Assembly of Stochestic Genes Starting From A 
GrouoOf Heptamers .. 
95 This procedure differs from that just dossed l.n 
that it utilizes palindromic heptamers which have 
variable cohesive ends, in place of ^^J*. 
sequences containing a smaller number of identical 

motifs. 

100 

a) Choice of a Group of Heptemers .„..__ 
It is possible, es an example, to use the following 
three palindromic heptamers: 



105 



5*XTCGCGA3' 
5' XCTGCAG 3' 
5' RGGTACC 3' 



where X- A. G. C. or T end B - A or T. -"dwhere 
110 polymerizetion cennot generate any stop codons 
ind forms triplets specifying ell the amino ec-ds. 

Cleerly it is possible to use other groups ot 
heptamers fulfilling these seme conditions. 

115 b) Polymerizetion of a Group of Heptamers 

This polymerization is carried out exactly m the 
feshion described above for octamers. 

c) Elimination Of Cohesive Extremities 
120 Thepolymersthusobtainedh»veoneunpa.r.d 

base on their two 6' xtrem.t.es. Thus, it is 

necesseryt eddthecomp. 

corresp nding3'extr mit.es Th ,sis earned uta 

Si ws: 10 micrograms fthedoubl 
125 polymerserer acted with 1 units ftheKlenow 
enzyme, in the pr sence of the IJ* 
deoxynucle tidephosphat s (200 mM) in a voiu 
of 1 00 ML at 4-C. for 60 minutes. The ^enzyme .s 
i„,ctivatd by phenol ^TS^Si 



nucleotides by differential pr "l*^™*, 
polymers ere then ligetec t the I D 
(previously linearized and ^ 

lowing the P^SS^^ 
c It is to be noted that the two * f 
5 were described utilize P^SSSS^ 
heptamers which •"^■^XitDMl- 
restriction enzymes. Thes arte. « a it 

most part, from the pUC8 e ^'°" n ^ mp iexity 
no is possible to eugment considerably we ro ? 
10 oVTn initial preparation of «ochas^c genw by 
proceeding in the following way: thai alasmia ur, 
Serived from the culture of 10' '"dependent 
trensf ormsnts obtained by one ofth. tjwj* 
« orocedures described above, is isolated. Arter tnis 
15 SSto purified, it is partially digested by 

friction enzyme (procedure o^^Mion 
restriction enzyme DN A « Jreated with 

of the enzyme. P- h ^«SSS3 Seating a 
->n T4 DNA liaase, which has tne enew u 

1 can be examined. » ttn «rate novel 

Finally, for all the P"*^^JjS5Srf 
synthetic genes, it is possible ^o use a nurn^ 

40 common techniques to ™ d '* 9 en " ^raion 

or severel [ WW^I Sve description, it is dear 

diff erent genes, by « n 'V m8 ^E* ™, 
.... «r «f olioonucleotides. i nis 



Clearly, in addition to the procedures which were 

loacihie to use a I other 

70 

single stranded o^o^rs ot t>N * o seg ments of 
oMWA*by MtaWished procedures u?generate 

75 genes. 



Screening Or Selection Of Cones Of Trensformed 

transfected cells '?*? ^^ Sorming or 
to isolate one or several cells whose >"T*° 

'^ssssssssssr 



90 si 




exploitable biochemicel propert.es. 

W. shell now describe, as •jJjSSlno or 
exemples. preferred P^^°^S«,«th. 
selection of clones of W8nsf ° r ,7~Se 0 ointof view 
1 00 novel proteins are of interest from the po.m o 
1 W ^industrial or medical "PP"" 1 !^ jde8 of 

One of these procedures mat." J» '™ £, on8 , 

producing, or ^.jfiS^^ES-- 
antibodies, by wwW,she h V^ of molecule of 
10 5 against a protein ° r ""° he r^-o ^ 

biochemical or medical '* 8 ' e *;T jmmU nogenic. 
molecule is. or hes been rendered, mm ub 

identify among the vw™*^ thQSe ^ose 

protein react witn www • n . w 

. n»utt of a structural homology wn '"' ■ 
tl^n the polypeptide synthesized by the 
between tne poyp v molecule. It is 



nudS or of oiiflonuc.^ides^s "5 po^^^^^^^le 



polymerization is "" ,e °° ' ve concentrations of 
es determined by the respective co he 
the nucleotides or oligonucleotides prese 

reaction mixture. „ th(M js can be utilized 

As indicted ebove. two met "<* 8 £ ); 

55 t c. « ene8 £ r Mrn S ?o£ectlyon. 

poiymerization can be earned ut u8 , y 

cloning expr ss. " ^I'^^ed sequentially 
linearized; or it is poss.ble P««J" f 
to the polymerization th n the hgat, 

60 polymers to the '^^^^r nnnfn nr ftliflD ° f 

penes ini^jjcenTwj^ 
R «;1^nKSK5T?3and expr ssed 



possible ,n tnis w-y - — " ope8 or entig enic 
proteins which beheve as e pn dp , 
determinents on the initial J™^™* ,JJ modu)8 te. 
proteins ere liable to s,mu»a ; -g-J^ be 
or block the effect of the ™™ . cr eening mey 
,20 clearthatthismeansofseert^ 

itself have very many ; P" 8 '™^ descfib e. as a 
biochemicel •PP ,,c8t '™^ f operation in 
n n limiting example, this first moo* 

a concrete cese: „ prote in 

12 5 EGF. ( Pid rmel growth factor * ^ 
present in the ^blo d .** ^ j$ oM8ined by 
growth of P' th specific receptor 

theinteracti n fEG . F r 3 a feDithe r,al cells, 
situat d in the membrane J «P"" e epare d by 
13 0 Antibodies directed against EGF are prep 



GB 2 183 661 A 6 



injecting animals with EGF coupled to KLH (keyh le 

limpet hemocyanin) to augment the 
immunogenic^ of the EGF. The anti-EGF 

antibodies of the immunized animals are punfied, 
5 for example, by passage over an affinity column, 
where the ligand is EGF or a synthetic peptide 
corresponding to a fragment of EGF. The punfied 
anti-EGF antibodies are then used as probes to 
screen a large number of bacterial clones lysedby 
10 cloroform, and on a solid support. The anti-EGF 
antibodies bind those stochastic peptides or 
proteins whose epitopes resemble those of the 
initial antigen. The clones containing such peptides 
or proteins are shown by autoradiography after 
1 5 Incubation of the solid support with radioactive 
protein A. or after incubation with a radioactive anti- 
antibody antibody. , ... 

These steps identify those clones, each of which 
contains one protein (and its gene) reacting with the 
20 screening antibody . It is feasible to screen among a „ 
very large «* colonies ofbactena l cells or 

viral p'«n"*« tfnr sample, on th 
*1 'OQO'flM and it is feasible to d *"** extremely, 
small quantities, on the order of 1 nanogram, of 
25 protein product. ThftTflfifter, tfw IdenrtWd clones are 
Bdmifid and lbfl nimrini so detected are punttedjn 
fnnventional ways . These proteins are tested in 
vitro in cultures of epithelial cells to determine if 
they inhibit, simulate, or modulate the effects of cur 
30 on these cultures. Among the proteins so obtained, 
some may be utilized for the chemotherapeutic 
treatment of epitheliomes. The activities of the 

proteins thus obtained can be improved by 
mutation of the DNA coding for the proteins, in 

35 ways analogous to those described above. A variant 
of this procedure consists in purifying these 
stochastic peptides, polypeptides or proteins, which 
can be used as vaccines or more generally to comer 
an immunity against a pathogenic agent or to 
40 exercise other effects on the immunological system, 
for example, to create a tolerance or diminish 
hypersensitivity with respect to a given antigen. In 
particular due to binding of these peptides, 
polypeptides or proteins with the antibodies 
45 directed against this antigen, h is clear that it is 
possible to use such peptides, polypeptides or 
proteins in vitro as well as in vivo. 

More precisely, in the ensemble of novel proteins 
which react with the antibodies against a given 
50 antigen X, each has at least one epitope in common 
with X, thus the ensemble has an ensemble of 
epitopes in common with X. This permits utilization 
of the ensemble or sub-ensemble as a vaccine to 
confer immunity against X. H is, for example, easy to 
55 purify ne r several of the capsidproteinsoftn 
hepatitis B virus. Th se proteins can then be 
ini cted into an animal, for exempl • "J*« 
the antibodies corresponding to the initial antigen 
canberec vered by affinity column pun^tiorh 
60 These antibodies may be used, as descnbed above, 
to identify clones producing at least one rpr je.n 
having an epitope resembling at least one of the 
pitopes of the initial antigen. After punficetion 
these pr teinsar used as antigens either al neor 

pc; ; n rnmhinptinn! with thP aim of conferring 



protection against hepatitis B. The final production 
f the vaccine does not require further access to the 
initial pathogenic agent. 
Note that, during the description of the 
70 procedures above, a number of means to achieve 
selection or screening-have been described. All 
these procedure may require the purification of a 
particular protein from a transformed clone. These 
protein purifications can be carried out by 
75 established procedures and utilize, in particular, the 
techniques of gel chromatography, by ion 
exchange, and by affinity chromatography. In 
addition, the proteins generated by the stochastic 
genes can have been cloned in the form of hybrid 
80 proteins ha\ ing, for example, a sequence of the P- 
galactosidase enzyme which permits affinity 
chromatography against anti-0-galactosidase 
antibodies, and allows the subsequent cleavage of 
the hybrid part (that is to say, allowing separation of 
85 the novel part and the bacterial part of the hybrid 
protein. Below we describe the principles and 
procedures Iff '^ion of pftntifl** nr polypeptides 
and the corresponding ge ™« according to a second 
mftthff H nf screening or selection based on the 
90 dete ction nf thr ~r«"fry ***** peptides QL 
pol ypeptides to catalyse a specific re action. 

As a concrete and non limiting example, 
screening or selection in the particular case of 
proteins capable of catalysing the cleavage of 
95 lactose, normally a function fulfilled by enzyme p- 
galactosidase (p-gal) will be described. 

As above described, the first step of the process 
consists in generating a very large ensemble of 
expression vectors, each expressing a distinct novel 
1 00 protein. To be concrete, for example, one may 

choose the pUC8 expression vector with cloning of 
stochastic sequences of DNA in the Pstl restriction 
site. The plasmids thus obtained are then 
Introduced in a clone of E coli from whose genome 
105 the natural gene for P-galactosidase.Z, and a 

second gene EBG, unrelated to the first but able to 
mutate towards p-gal function, have both been 
eliminated by known genetic methods. Such host 
cells (Z-, EBG*) are not able by themselves to 
1 10 catalyse lactose hydrolysis, and as a consequence to 
use lactose as carbon source for growth. This 
permits utilization of such host clones for screening 
or selection for P-gal function. 
A convenient biological assay to analyse 
115 transformed E coli clones for those which have 
novel genes expressing a P-gal 
the culture of bacteria transformed as descnbed in 
petri dishes containing X-gal in the medium. In this 
case, all bacterial colonies expressing a p-gai 
1 20 function are visualized as blue colonies. By using 
suchabi I gical assay, it is possible to detect even 

weak catalytic activity. The specific ectivjtyof 
characteristic enzym s ranges from 10 and 10.000 
productm I cul sp rs cond. 
1 25 Supposing that a pr tein synthesized by a 
" hasaweakspecifica^i ^ty on he 

order f one mol cul per 100 seconds, « reruns 
possibl t det ct such catalytic activity. In a petn 
dish containing X-gel in the medium and in the 
130 or s nee ofthe non metabolizable inducer IPTb 
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(isopr pyl-D-thiogalact side) visualization ^f a blue 
region requires cleavage f about 10 10 to 10 
molecules of X-gal per square millimeter. A 
bacterial colony expressing a weak enzyme and^ 

5 occupying a surface area of 1 mm* has about 10 to 
10 8 cells. If each cell has only one copy of the weak 
enzyme, each cell would need to catalyse cleavage 
of between 10'000 and 100 of x-gal to be detected, 
which would require between 2.7 and 270 hours. 

10 Since under selective conditions it is possible to 
amplify the number of copies of each plasmid per 
cell to 5 to 20 copies per cell, or even to 100 to 1 000, 
and because up to 1 0% of the protein of the cell can 
be specified by the new gene, the duration needed 

15 to detect a colony blue in the case of 100 enzyme 

molecules of weak activity per cell is on the order of 

0.27 to 2.7 hours. 

As a consequence of these facts, screening a very 
large number of independent bacterial colonies. 

20 each expressing a different novel gene, and using 
the capacity to express a p-gal function as the 
selection criterion, is fully feasible. It is possible to 
carry out screening of about 2000 colonies in one 
Petri dish of 10 cm diameter. Thus, about 20 million 

25 colonies can be screened on a sheet of X gal agar of 

1 square meter. 

It is to be noted that bacterial colonies which 
appear blue on X gal Petri dishes might be false 
positives due to a mutation in the bacterial genome 
30 which confers on it the capacity to metabolize 
lactose, or for other reasons than those which result 
from a catalytic activity of the novel protein 
expressed by the cells of the colony. Such false 
positives can be directly eliminated by purifying the 
35 DN A of the expression vector from the positive 
colony, and retransforming EBG'Ecoii host 
cells. If the fl-gal activity is due to the novel protein 
coded by the new gene in the expression vector, an 
those cells transformed by that vector will exhibit fl- 
40 gal function. In contrast, if the initial blue colony is 
due to a mutation in the genome of the host cell, It is 
a rare event and independent of the transformation, 
thus the number of cells of the new clone of 
transformed E coli capable of expressing gal 
45 function will be small or zero. 

^~The power of mass simultaneous purification of 
all the expression vectors from all the positive 
clones (blue) followed by ^transformation of naive 
bacteria should be stressed. Suppose that the aim is 
50 to carry out a screening to select proteins having a 
catalytic function, and that the probability that a new 
peptide or polypeptide carries out this function at 
least weakly is 10"*. while the probability that a 
clone of the E coli bacterial host undergoes a 
55 1 mutation rendering it capable of carrying out the 
I same f uncti n is 10-*. then it can be calculated that 
I among 20 milli n transformed bacteria which are 
screened, 20 positive clones will be attributable t 
thenov I gen sin xpressi n vectors which each 
60 carri s. while 200 positive clones will be the result ot 
g nomicmutati n.Masspurificati nofthe 
expression vectors fr m the total of 220 positive 
bacterial clones followed by ^transformation of 
naiv bact riawithth mixture of these expression 



cl nes consisting of all those bacteria transformed 
with the 20 expression vectors which code for the 
novel proteins having the desired function, and a 
very small number of bacterial clones resulting from 
70 genomic mutations and containing the 200 

expression vectors which are not of interest^AsmaN 
number of m lf^ o* purification of expression 
vectors from ^ fl fl itivp hartpr i al rn l nnies , fallow 
such rotranc^rmfttinn. allows the detection of very 
7* ^K^p^^^v^^^v positive for a desired 
catalytic"aa? yjl^xlespite a high background rate of 
' mutations in the host cells for the same function. 
Following screening operations of this type, it is 
possible to purify the new protein by established 
80 techniques. The production of that protein in targe 
quantity is made possible by the fact that 
identification of the useful protein occurs together 
with simultaneous identification of the gene coding 
for the same protein. Consequently, either the same 
85 expression vector can be used, or the novel gene 
can be transplanted into a more appropriated 
expression vector for its synthesis and isolation in 

large quantity. , 
It is feasible to apply this method of screening for 
90 any enzymatic f unction for which an appropriate 
biological assay exists. For such screenings, it is not 
necessary that the enzymatic function which is 
sought be useful to the host cell. It is possible to 
carry out screenings not only for an enzymatic 
95 function but for any other desired property for 
which it is possible to establish an appropriate 
biological assay. It is thus feasible to carry out, even 
in the simple case of P-gal function visualized on an 
X-gal Petri plate, a screening of on the order of 100 
100 million, or even a billion novel genes for a catalytic 
activity or any other desired property. 



Selection Of Transformed Host Cells 
On the other hand, it is possible to use selection 
1 05 techniques for any property, catalytic or otherwise, 
where the presence or absence of the property can 
be rendered essential for the survival of the host 
cells containing the expression vectors which code 
for the novel genes, or also can be used to select for 
1 1 0 those viruses coding and expressing the desired 
novel gene. As a non-limiting, but concrete 
example, selection for 0-galactosidase to™* 0 " - 
shall be described. An appropriate clone of Z fcou 
E. coli is not able to grow on lactose as the sole 
1 1 5 carbon source. Thus, after carrying out the first step 
described above, it is possible to culture a very large 
number of host cells transformed by the expression 
vectors coding for the novel genes, under selective 
conditions, either by progressive diminution of 
1 20 other sources of carbon, or utilization of lactose 
alone fr m the start. During the course of such 
sel cti n. in vivo mutagenesis by recombination, or 
by explicity recovering the expression vectors ana 
mutagenizing their novel genes in vitro by various 
125 mutag ns, r by any ther common technique, 
permits adaptive improvements in the capacity io 
fulfil the desired catalytic function. When both 
selection techniques and convenient bi assay 
techniques exist at the same time, as in the pr sent 
no c»<* it is oossible to use selection techniques 
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initially to enrich the repr sentati nofh st bacteria 
expressing the 0-gal function, then carry ut a 
screening on a Petri plate on X-gel medium to 
establish efficiently which are the positive cells. In 
5 (the absence of convenient bioassays, application of 
progressively stricter selection is the easiest route to 
J purify one or a small number of distinct host cells 
/ whose expression vectors code for the proteins 
^catalysing the desired reaction. 
10 It is possible to utilize these techniques to find 
novel proteins having a large variety of structural 
and functional characteristics beyond the capacity 
to catalyse a specific reaction. For example, it is 
possible to carry out a screen or select for novel 
1 5 proteins which bind to cis-regulatory sites on the 
DNA and thereby block the expression of one of the 
host cell's functions, or block transcription of the 
DNA, stimulate transcription, etc. 
For example, in the case of E coii, a clone mutant 
20 in the repressor of the lactose operon (i-) expresses 
P-gal function constttutively due to the fact the 
lactose operator is not repressed. All cells of this 
type produce blue clones on Petri plates containing 
X-gal medium. It is possible to transform such host 
25 cells with expression vectors synthesizing novel 
proteins and carry out a screen on X-gal Petri plates 
in order to detect those clones which are not blue. 
Among those, some represent the case where the 
new protein binds to the lactose operator and 
30 represses the synthesis of P-gal. ft is then feasible to 
mass isolate such plasmids, retransform, isolate 
those clones which do not produce p-gal, and 
thereafter carry out a detailed verification. 
A§ mtnttnnirri t hnvp thf> process c an be utilized 
35 in nrri ?r m create »h«n ^"htA. Dfll Qnl V exploitable 
proteins, but also RN A and DNA as products in 
themselves, having exploitable properties. This 
results from the fact that, on one hand, the 
procedure consists in creating stochastic sequences 
of DNA which may interact directly with other 
cellular or biochemical constituents, and on the 
other hand, these sequences cloned in expression 
vectors are transcribed into RNA which are 
themselves capable of multiple biochemical 
interactions. 



An Example Of The Us e Of The Procedure To Create 
And Select For Aj^N^Which Is Useful In Itself. 
This example illustrates selection for a useful 

50 DNA, and the purification and study of the 

mechanism of action of regulatory proteins which 
bind to the DNA. Consider a preparation of the 
oestradiol receptor, a protein obtained by standard 
techniques. In the presence of oestradiol, a steroid 

55 sexual h rm ne,th receptor changes 

conf rmati n and binds tightly t c rtain specific 
sequences in the genomic DNA, thus affecting the 
transcription fg n s implicated in sexual 
differential nandth c ntrol ff rtilityjjy 

60 fneubatino a mixture containing gentry dipt. Its. 

r c ptor.andaln rgftn >im ^ rftfrii ^^ 
DNA seouenc s inserted in the jf Rectors, f ol lowed. 

hy filtration f the V* ff cross a I 

membrane, one has a dir c t selection for those 
65 ItocFTastic DN A s qu nc s binding t the oestrogen 



receptor complex, where only those DNAs bound to 
a protein are retained by the membrane. After 
washing and elution, the DNA liberated from the 
membrane i s utilized as such to transform bacteria. 
70 After culture of the transformed bacteria, the vectors 
which they contain are again purified and one or 
several cycles of incubation, filtration and, 
transformation are carried o u^ as described above, 
These procedures allow the isolation of stochastic 
75 sequences of DNA having an elevated affinity for the 
oestradiol-receptor complex. Such sequences are 
open to numerous diagnostic and pharmacologic 
applications, in particular, for developing synthetic 
oestrogens for the control of fertility and treatment 
80 of sterility. 

* 

Creation And Selection Of An RNA Useful In Itself 
Let there be a large number of stochastic DNA 
sequences, produced as has been described and 

85 cloned in an expression vector. It follows that the 
RNA transcribed from these sequences in the 
transformed host cells can be useful products 
themselves. As a non limiting example, it is possible 
to select a stochastic gene coding for a suppressor 

90 transfer RNA (tRNA) by the following procedure :A_ 
lame number telQ*) of stochastic segu ing** are 
transformed into competent bacterial hosts carrying 
a "nonsens e" mutation in the ar p E. gene. These 
transformed bacteria are plated on minimal medium 

95 without arginine and with the selective antibiotic for 
that plasmid (ampicillin if the vector is pUC8). Only 
those transformed bacteria which have become 
capable of synthesizing arginine will be able to 
grow. This phenotype can result either from a back 
1 00 mutation of the host genome, or the presence in the 
^ cell of a suppressor. It is easy to test each 
transformed colony to determine if the arg + 
phenotype is or is not due to the presence of the 
stochastic gene in its vector; it suffices to purify the 
1 05 plasmid from this colony and verify that K confers 
an Arg+ phenotype on all arge E cells transformed 
by it. 

* 

Selection Of Proteins Capable Of Catalysing A 

110 Sequence Of Reactions 

Below we describe another means of selection, 
open to independent applications, based on the 
principle of simultaneous and parallel selection of a 
certain number of novel proteins capable of 

115 catalysing a connected sequence of reactions. 
The basic idea of this method is the following : 
given an initial ensemble of chemical compounds 
considered as building blocks or elements of 
construction from which it is hoped to synthetize 

1 20 one or several desired chemical compounds by 
m ansof acatalys d sequence of chemical 
reactions, there exists a very large number of 
reacti n routes which can be partially or completely 
substitut dfor nean ther, which are all 

1 25 thermodynamically possible, and which lead from 
the set of building blocks to the desired target 
compound(s). Efficient synthesis f a target 
comp undisfav red if each step of at least one 
reaction pathway leading f r m the building block 

130 comp undstothetarg t compound is comprised of 
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reactions each of which is catalysed. On the ther 
hand, it is relatively less important to determine 
which among the many independent or partially 
independent reaction pathways is catalysed. In the 

5 previous description, we have shown how it is 
possible to obtain a very targe number of host cells 
each of which expresses a distinct novel protein. 

Each of these novel proteins is a candidate to 
catalyse one or another of the possible reactions, in 

10 the set of all the possible reactions leading from the 
ensemble of building blocks to the target 
compound. If a sufficiently large number of 
stochastic proteins is present in a reaction mixture 
containing the building block compounds, such that 

15 a sufficiently large number of the possible reactions 
are catalysed, there is a high probability that one 
connected sequence of reactions leading from the 
set of building block compound to the target 
compound will be catalysed by a subset of the novel 

20 proteins. It is clear that this procedure can be 
extended to the catalysis not only of one, but of 
several target compounds simultaneously. 

Based on this principle it is possible to proceed as 
follows in order to select in parallel a set of novel 

25 proteins catalysing a desired sequence of chemical 
reactions: 

1 . Specify the desired set of compounds 
constituting the building blocks, utilizing 
preferentially a reasonably large number of distinct 

30 chemical species in order to increase the number of 
potential concurrent reactions leading to the desired 
target compound. 

2. Using an appropriate volume of reaction 
medium, add a very large number of novel 

35 stochastic proteins isolated from transformed or 
transfected cells synthesizing these proteins. Carry 
out an assay to determine if the target compound is 
formed. If it is, confirm that this formation requires 
the presence of the mixture of novel proteins. If so, 

40 then the mixture should contain a subset of proteins 
catalysing one or several reaction pathways leading 
from the building block set to the target compound. 
Purify and divide the initial ensemble of clones 
which synthesize the set of novel stochastic 

45 proteins, the subset which is required to catalyse the 
sequence of reactions leading to the target 
compound. 

More precisely, as a non limiting example, below 
we describe selection of novel proteins capable of 
50 catalysing the synthesis of a specific small peptide, 
in particular, a pentapeptide, starting from a 
building block set constituted of smaller peptides 
and amino acids. All peptides are constituted by a 
linear sequence of 20 different types Of amino acids, 
55 oriented from the amino to the carboxy terminus. 
Any peptide can be formed in a single st p by th 
terminal condensation ftw smaller peptid s( rof 
two amino acids), or by hydrolysis of a larger 
peptide. A peptide with M residu s can thus be 
60 formed by M-1 condensation r actions. The number 
of reactions, R, by which a set of peptides having 
I ngth 1, 2, 3, ... M residu scan be interc nverted 
is larger than the number of possible molecular 
speci s, T. This can be expressed as R/TsM-2. Thus, 



large number of independent or partially 
independent reaction pathways lead to the 
synthesis of a specific target peptide. Choose a 
pentapeptide whose presence can be determined 
70 conveniently by some common assay technique for 
example HPLC (liquid phase high pressure 
chromatography), paper chromatography, etc. 
Formation of a peptide bond requires energy in a 
dilute aqueous medium, but if the peptides 
75 participating in the condensation reactions are 
adequately concentrated, formation of peptide 
bonds is thermodynamically favored over 
hydrolysis and occurs efficiently in the presence of 
an appropriate enzymatic catalyst, for example 
80 pepsin or trypsin; without requiring the presence of 
ATP or other high energy compounds. Such a 
reaction mixture of small peptides whose amino 
acids are marked radioacti vely to act as tracers with 
3 H, ,4 C, M S, constituting the building block set can 
85 be used at sufficiently high concentrations to lead to 
condensation reactions. 

For example. It is feasible to proceed as follows: 
15 mg of each amino acid and small peptides having 
2 to 4 amino acids, chosen to constitute the building 
90 block set are dissolved in a volume of 0,25 ml to 1.0 
ml of a 0,1 M pH 7,6 phosphate buffer. A large 
number of novel proteins, generated and isolated as 
described above are purified from their bacterial or 
other host cells. The mixture of these novel proteins 
95 is dissolved to a final concentration on the order of 
0,8 to 1 ,0 mg/ml in the same buffer. 0,25 ml to 0,5 ml 
of the protein mixture is added to the mixture of 
building blocks. This is incubated at 25°C to 40°C for 
1 to 40 hours. Aliquots of 8 pj are removed at regular 
1 00 intervals, the first is used as a "blank" and taken 
before addition of the mixture of novel proteins. 
These aliquots are analysed by chromatography 
using n-butanol-acetic acid-pyridine-water 
(30:6:20:24 by volume) as the solvent. The 
1 05 chromatogram is dried and analysed by ninhydrin 
or autoradiography (with or without intensifying 
screens). Because the compound constituting the 
building block set are radioactively marked, the 
target compound will be radioactive and it will have 
110 specific activity high enough to permit detection at 
the level of 1—10 ng. In place of standard 
chromatographic analysis, it is possible to use HPLC 
(high pressure liquid chromatography) which is 
faster and simpler to carry out. More generally, all 
115 the usual analytic procedures can be employed. 
Consequently it is possible to detect a yield of the 
target compound of less than one part per million by 
weight compared to the compounds used as initial 
building blocks. 
120 If the pentapeptide is form d in the conditions 
described above, but not when an xtract is utilized 
which is derived from host cells transformed by an 
expression vector containing no stochastic genes, 
thef rmati n fthepentap ptideisnotth result of 
125 bacterial contaminants and thus r quires th 
presenc f a subset of the novel proteins in the 
reacti n mixture. 

The f flowing step consists in the separation of 
the particular subset of cells which contain 

• «■»»» ...... .,,r.U «Miinl n>^*n>r\C 
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catalysing the sequence of reactions leading to the 
target pentapeptide. As an example, if the number 
of reactions forming this sequence is 5. there are 
about 5 novel proteins which catalyse the necessary 
5 reactions. If the clone bank of bacteria containing 
the expression vectors which code for the novel 

genes has a number of distinct novel genes which is 
on the order of 1 'OOO'OOO all these expression 
vectors are isolated en masse and retransformed 
10 into 100 distinct sets of 10 s bacteria at a ratio of 
vectors to bacteria which is sufficiently low that on 
average, the number of bacteria in each set which 
are transformed is about half the number of initial 
genes, i.e. about SOO'OOO. Thus, the probability that 
1 5 any given one of the 1 00 sets of bacteria contains 
the entire set of 5 critical novel proteins is 11/2) -1/ 
32. Among the 100 initial sets of bacteria, about 3 
will contain the 5 critical transformants. In each of 
these sets, the total number of new genes is only 
20 BOO'OOO rather than 1 '000'000. By successive 
repetitions the total number of which is about 20 in 
the present case, this procedure isolates the 5 
critical novel genes. Following this, mutagenesis 
and selection on this set of 5 stochastic genes allows 
25 improvement of the necessary catalytic functions. In 
a case where it is necessary to catalyse a sequence 
of 20 reactions and 20 genes coding novel proteins 
need to be isolated in parallel, it suffices to adjust 
the multiplicity of transformation such that each set 
30 of 10* bacteria receives 80% of the 10 6 stochastic 
genes, and to use 200 such sets of bacteria. The 
probability that all 20 novel proteins are found in 
one set is 0.8 w s0,01 6. Thus, about 2 among the 200 
sets will have the 20 novel genes which are needed 
35 to catalyse the formation of the target compound. 
The number of cycles required for isolation of the 20 
novel genes is on the order of 30. 

The principles and procedures described above 
generalize from the case of peptides to numerous 
40 areas of chemistry in which chemical reactions take 
place in aqueous medium, in temperature, pH, and 
concentration conditions which permit general 
enzymatic function. In each case it is necessary to 
make use of an assay method to detect the 
45 formation of the desired target compound(s). It is 
also necessary to choose a sufficiently large number 
of building block compounds to augment the 
number of reaction sequences which lead to the 
target compound. 
50 The concrete example which was given for tne 
synthesis of a target pentapeptide can also be 
generalized as follows: 

The procedure as described, generates among 
other pr ducts, stochastic peptides and proteins. 
55 Th s peptides or proteins can act, catalytically or in 
ther ways, on other compounds. They can equally 
c nstitute the substrates n which they act. Thus, It 
isp ssibte to select ( r screen) forth capacity of 
such stochastic peptides rpr teinst interact 
60 among themselves and thereby modify the 

conf rmati n, the structure or the functi nofs me 
em ng them. Similarly, it is possible to select lor 
scr n) for the capacity of these peptides and 
prot ins to catalyse among themselves, hydrolysis. 



modifying the peptides. For example, the hydrolysis 
of a given stochastic peptide by at least one member 
of the set of stochastic peptides and proteins can be 
followed and measured by radioactive marking of 
70 the given protein followed by an incubation with a 
mixture of the stochastic proteins in the presence of 
ions such as Mg. Ca, Zn, Fe and ATP or GTP. The 
appearance of radioactive fragments of the marked 
protein is then measured as described. The 
75 stochastic protein(s) which catalyse this reaction 
can again be isolated* along with the genets) 
producing them, by sequential diminution of the 
library of transformed clones, as described above. 
An extension of the procedure consists in the 
80 selection of an ensemble of stochastic peptides and 
polypeptides capable of catalysing a set of reactions 
leading from the initial building blocks (amino acids 
and small peptides) to some of the peptides or 
polypeptides of the set. It is therefore also possible 
85 to select an ensemble capable of catalysing its own 
synthesis; such a reflexively autocatalytic set can be 
established in a chemostat where the products of 
the reactions are constantly diluted, but where the 
concentration of the building blocks is maintained 
90 constant Alternatively, synthesis of such a set is 
aided by enclosing the complex set of peptides in 
liposomes by standard techniques. In a hypertonic 
aqueous environment surrounding such liposomes, 
condensation reaction forming larger peptides 
95 lowers the osmotic pressure inside the liposomes, 
. drives water molecules produced by the 

condensation reactions out of the liposomes, hence 
favors synthesis of larger polymers. Existence of 
such an autocatalytic ensemble can be verified by 
100 two dimensional gel electrophoresis and by HPLC, 
showing the synthesis of a stable distribution of 
peptides and polypeptides. The appropriate reaction 
volume depends on the number of molecular 
species used, and the concentrations necessary to 
105 favor the formation of peptide bonds over their 
hydrolysis. The distribution of molecular species of 
an autocatalytic ensemble is free to vary or change 
due to the emergence of variant autocatalytic 
ensembles. The peptides and polypeptides which 
110 constitute an autocatalytic set may have certain 
elements in common with the large initial ensemble 
(constituted of coded peptides and polypeptides as 
given by our procedure) but can also contain 
peptides and polypeptides which are not coded by 
1 15 the ensemble of stochastic genes coding for the 
initial ensemble. 

The set of stochastic genes whose products are 
necessary to establish such an autocatalytic set can 
be isolated as has been described, by sequential 
120 diminution of the library of transform d clones. In 
addition, an autocatalytic set can contain coded 
peptid s initially coded by th stochastic genes and 
synthesiz d continuously in the autocatalytic set. To 
isolate this c ded subset fp ptides and prot .ns, 
125 theautocatyticsetcenbeus dt btain. through 
Immunization in an animal, p lyclonal sera 
recognizing a very large number f constituents of 

theaut catalytic set. f 
Theses ra can be utilized to screen th library 
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proteins ablet combine with the antib dies 
present in the sera. 

This set of stochastic genes expresses a large 
number of coded stochastic proteins which persist 
5 in the autocatalytic set. The remainder of the coded 
constituents of such an autocatalytic set can be 
isolated by serial diminution of the library of 
stochastic genes, from which the subset detected by 
immunological methods has first been subtracted. 
1 0 Such autocatalytic sets of peptides and proteins, 
obtained as noted, may find a number of practical 
applications. 

CLAIMS 

15 2) Process for the production o f peptides or 
polypeptides bv microbiological means, 
characterized in that genes which are at least 
partially composed of stochastic synthetic 
polynucleotides are produced simultaneously in a 

20 common milieu, that the genes thus obtained are 
introduced into host cells, that the independent 
clones of the transformed host cells containing 
these genes are simultaneously cultivated so as to 
clone the stochastic genes and lead to the 

25 production of proteins expressed by each of these 
stochastic genes, that screening and/or selection is 
carried out on such clones of transformed host cells 
to identify those clones producing peptides or 
polypeptides having at least one specified property, 

30 that the clones so identified are isolated, then grown 
in a manner so as to produce at least one peptide or 
polypeptide having the said property. 

2. Process according to claim 1, characterized by 
the fact that the genes are produced by stochastic 

35 copolymerization of the four types of 

deoxyphosphonucleotides A, C, G and T, starting 
from the two extremities of an expression vector 
which was previously linearized, then by formation 
of cohesive extremities to create a first strand of 

40 stochastic DNA constituted of a molecule of 
expression vector possessing two stochastic 
sequences whose 3' extremities are 
complementary, followed by synthesis of the 
second strand of the stochastic DNA. 

45 3. Process according to claim 1 , characterized by 
the fact that the genes are produced by stochastic 
copolymerization of double stranded 
oligonucleotides which do not have cohesive ends, 
in a manner so as to form fragments of stochastic 

50 ONA, followed by ligation of these fragments in an 
expression vector which was previously linearized. 

4. Process according to claim 2 or 3, characterized 
by the fact that the expression vector is a plasmid. 

5. Process according to claim 4, characterized by 
55 the fact that the expression vector is pUC8. 

6. Pr cess according to claim 2 or claim 3. 
characterized by the fact that expression vector is a 
fragment of viral DNA. 

7. Pr cess according t claim 2 r claim 3, 

60 characterized by the fact that the expression vector 
is a hybrid f plasmid and viral DNA. 

6. Process according to claim 1 1 6, characterized 
by the fact that the host cells are prokaryottc cells. 

9. Process according to claims 1 to 7 characterized 



10. Pr cess according to claim 8, characterized by 
the fact that the cells are chosen among H8101 and 
C600. 

1 1 . Process according to claim 3, characterized by 
70 the fact that the oligonucleotides form a group of 

palindromic octamers. 

12. Process according to claim 1 1 characterized by 
the fact that the group of palindromic octamers is 
the following group: 

75 

5' GGAATTCC3' 
5' GGTCGACC3' 
5' CAAGCTTG 3' 
5' CCATATGG 3 r 
80 5' CATCGATG 3' 

13. Process according to claim 3, characterized by 
the fact that the oligonucleotides form a group of 
palindromic heptamers. 

85 Process according to claim 13, characterized 
by the fact that the group of palindromic heptamers 
is the following group: 

5'XTCGCGA3' 
5' XCTGCAG 3' 
90 5'RGGTACC3' 

where X* A, G, C. or T, and R= A or T. 

15. Process according to claim 4 and one of the 
claim 12 or 14, characterized by the fact that one first 
95 isolates and purifies the transforming DNA derived 
from a culture of independent clones of transformed 
host cells obtained by proceeding in the manner 
specified in claim 1 1 or in claim 13, then that one 
cuts the purified DNA by means of at least one 

1 00 restriction enzyme which corresponds to a specific 
restriction site present in these palindromic 
octamers or heptamers, but absent from the 
expression vector being utilized, that one thereafter 
simultaneously treats the ensemble of linearized 

1 05 stochastic DNA fragments so obtained by T4 DNA 
ligase in such a manner as to create a new ensemble 
of DNA containing new stochastic sequences, and 
that one uses this new ensemble of transforming 
DNA to transform host cells and clone such genes, 

110 and finally that one carries out screening and/or 
selection and isolates the new clones of 
transformed celts and that one grows these so as to 
produce at least one peptide or polypeptide having 
a desired property. 

115 16. Process according to claim 1 , characterized by 
the fact that the said property is the capacity to 
catalyse a given chemical reaction. 

17. Process according to claim 1 for the 
production of several peptides and/or polypeptides, 

1 20 characterized by the fact that the said property is the 
capacity to catalyse a sequence of reactions leading 
from a given gr up f initial chemical compounds to 
at I ast one target compound. 

18. Pr cess according to claim 16 for the 

125 production of an ensemble consisting of more than 
one peptide and/or p lypeptide which is reflexively 
autocatalytic, charact rized by the fact that the said 
property is the capacity to catalyse the synthesis of 
the nsemble itself starting from amino acids and/or 
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19. Process according t claim 1 , characterized by 
the fact that the said property is the capacity to 
modify selectively the chemical and/or biological 
properties of a given compound. 
5 20. Process according to claim 19. characterized 
by the fact that the said property is the capacity to 
modify selectively the catalytic activity of a 
polypeptide. 

21. Process according to claim 19, characterized 
10 by the fact that the said property is the capacity to 

simulate or modify at least one biological function 
of at least one biologically active compound. 

22. Process according to claim 21, characterized 
by the fact that the said biologically active 

1 6 compound is chosen among the hormones, 

neurotransmitters, adhesion or growth factors, and 
specific regulators or replication and/or 
transcription of DNA, and/or translation of RN A. 

23. Process according to claim 1, characterized by 
20 the fact that the said property is the capacity to bind 

to a given ligand. 

24. Utilization of a peptide or polypeptide 
obtained by the process according to claim 23 for 
the detection and/or titration of the ligand. 

25 25. Process according to claim 1 , characterized by 
the fact that the property is to have at least one 
epitope similar to one of the epitopes of a given 
antigen. 

26. Process according to claim 19 and 25, 

30 characterized by the fact that the said property Is the 
capacity to simulate or modify the effects of a 
biologically active molecule, and that screening and/ 
or selection of the clones of transformed host cells 
producing at least one peptide or polypeptide 

35 having this property is carried out by preparing 
antibodies against that molecule, and utilizing these 
antibodies so obtained to identify those clones 
containing those peptides or polypeptides, then by 
growing the clones thus identified and separating 

40 and purifying the peptide or polypeptide produced 
by these clones, and finally by submitting these 
peptide(s) or polypeptide(s) to an assay in vitro to 
verify that it has in fact the capacity to simulate or 
modify the effects of the said molecule. 

45 27. Peptides or polypeptides obtained by the 
process according to claim 1 or claim 26, utilizable 
as active substances having a pharmacologic and/or 
chemotherapeutic action. 
28. Peptides or polypeptides, obtained by the 

50 process according to claim 25, utilizable to diminish, 
in vitro or in vivo, the concentration of free 
antibodies, specific for the said antigen, by 
formation of bonds between these peptides or 
polypeptides and these antibodies. 

55 29. Peptides rpolypeptid s according to claims 
27 r 28, utilizable as ag ntstosuppr ss 
immunological hypersensitivity. 

30. P ptid s rpolypeptid s, btain d by the 
process according to claim 25, utilizable as agents to 

60 create tol ranee with respect to the said antigen. 

31. Proc ss according t claim 25, characterized 
by the fact that the antigen is EGF. 

32. Peptides or polypeptides obtained by the 
orocess according to claim 31. utilizabl for the 



33. Proc ssacc rding to claim 1, characterized by 
the fact that cl nes f transformed host cells 
producing peptides or polypeptides having the 
desired property are identified and isolated by 

70 affinity chromatography on antibodies 

corresponding to a protein expressed by the natural 
fragment of the DNA hybrid. 

34. Process according to claim 33, characterized 
by the fact that the natural f ragmentof the DNA 

75 hybrid contains a gene expressing p-galactosidase, 
and that one identifies and isolates the said clones 
of transformed cell hosts by affinity 
chromatography with anti-0-galactosidase 

antibodies. 

80 35. Process according to claim 1 , or claim 34, 
characterized by the fact that after expression and 
purification of the hybrid peptides or polypeptides, 
the novel fragments are separated and isolated. 

36. Application of the process according to claim 
85 25 or claim 26, for the preparation of a vaccine, 

characterized by the fact that antibodies against a 
pathogenic agent are obtained and used to identify 
those clones producing at least one protein having 
at least one epitope similar to one of the epitopes of 
90 the pathogenic agent, that the corresponding clones 
of transformed host cells are grown in such a 
manner as to produce this protein, that the protein is 
isolated and purified from the cultures of clones of 
cells and that this protein is used for the production 
95 of a vaccine against the pathogenic agent. 

37. Application according to claim 36, for the 
preparation of an anti-HVB vaccine, characterized by 
the fact that at least one capsid protein of the HVB 
virus is extracted and purified, that this protein is 

100 injected into the body of an animal capable of 
forming antibodies against this protein, that these 
antibodies are recovered and purified, that these 
antibodies are used to identify those clones 
producing at least one protein having at least one 
105 epitope similar to one of the epitopes of the HVB 
virus, that the clones of transformed host cells 
corresponding to these clones are grown in a 
manner to produce this protein, that this protein is 
isolated and purified from these cultures of host 
1 1 0 cells, and that this protein is used for the production 
of an anti-HVB vaccine. 

38. Process according tm claim 1, characterized by 
the fact that the host cells consist in bacteria of 
Escherichia coli type, whose genome contains 
115 neither the natural P-galactosidase gene, nor the 
EBG gene, that is 2", EGB" E coli, and that these 
transformed host cells are cultured in an X-gal 
medium also containing the inducer IPTG, that 
clones which are positive for p-galectosidase 
120 fundi n are detected in the culture milieu, that 

thereafter this DNA is transplanted t a clone of host 
cells appropriate f r industrial production f at least 
on peptide, polypeptide or protein with p- 
galactosidase function. 
125 39. Process acc rding t claim 1,charactenzedt)Y 
the fact that the said pr party is the capacity to bind 
to a given compound. 

40. Pr c ss acc rding to claim 39. characterized 
by the fact that th said compound is chosen among 
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41. Pr cess according t claim 40. characterized 
by the fact that the said proteins are pr terns 
regulating the transcription activity or replication of 
DNA. 

5 42. Process according to claim 39, characterized 
by the fact that the said compound is chosen among 
the sequences of ONA and RNA. 

43. Proteins obtained by the process according to 
claim 40 or claim 42. 

1 0 (^Process for the production of ON A, 

characterized by the fact that, in the same milieu, 
genes which are at least partially composed of 
stochastic synthetic polynucleotides are produced, 
that the genes so produced fere introduced into host 

15 gells in a manner to produce an ensemble of 

transformed hngf CfiU& that thftse^m grown SO as to 

_ produce independent clones of the host cells so 
produced, that s cr y ning and ter. sele ction is carried 
out on this ensemble |p identify those host ceils 
20 which contain those stochastic sequences of ONA 
having at least one desired property, an d that such 
PNA is isolated from the identified cultures of the 
host cells. 

45. Process according to claim 44, characterized 
25 by the fact that the said property is the capacity to 

bind to given compound. 

46. Process according to claim 45, characterized 
by the fact that the said compound is chosen among 
the peptides, polypeptides and proteins. 

30 47. Process according to claim 45, characterized 
by the fact that the said compound is a compound 
regulating the transcription activity or the 
replication of ONA. 
48. Process according to claim 47, characterized 

35 by the fact that the said compound is a regulatory 
protein controlling the transcription or replication of 
ONA. 



49. Utilizati n of a sequence fONA btainedby 
thepr cess according to claim 46, r claim 47, as a 

40 cis-regutatory sequence of replication or 

transcription of a neighboring sequ nee of DNA. 

50. Process according to claim 42, characterized 
by the fact that the proteins obtained have the 
capacity to modify the transcription activity, the 

45 replication, or the stability of DNA. 

51. Utilization of a protein obtained by the process 
according to claim 48, to modify the transcription, 
replication or stability of a sequence of DNA in a cell 
containing this sequence of ONA and expressing 

50 thisprotein. 

^52) Process for the production of RNA, 
characterized by the fact that, in the same milieu, 
genes which are at least partially composed of 
. synthetic stochastic polynucleotides are produced 

55 eimuitaneouslv , that the genes th *** ^fa**'"*** * rp 
introduced in host cells in a manner to pr oduce 
ensemble of transformed host ce lls /hat ifr e 
independent clones of transformed host cells so 
produced are grown simultaneously , that the . 

60 screening and/or selection Is carried out on this 
ensemble in a manner to identify those host celts 
containing stochastic sequences of RNA havinp at 
least one desired property, and that the RNA is 
isolated from the cultures of host cells so identified. 

65 53. Process according to claim 52, characterized 
by the fact that the said property is the capacity to 
bind to a given compound. 

54. Process according to claim 52, characterized 
by the fact that the said property is the capacity to 

70 catalyse a given chemical reaction. 

55. Process according to claim 52, characterized 
by the fact that the said property is to be a transfer 
RNA. 
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